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Abstract. The 2-parameter family of massive variants of Einstein's gravity (on a Minkowski 
background) found by Ogievetsky and Polubarinov by excluding lower spins can also be derived 
^ ■ using universal coupling. A Dirac-Bergmann constrained dynamics analysis seems not to have 

' been presented for these theories, the Freund-Maheshwari-Schonberg special case, or any other 

massive gravity beyond the linear level treated by Marzban, Whiting and van Dam. Here 
the Dirac-Bergmann apparatus is applied to these theories. A few remarks are made on the 
question of positive energy. Being bimetric, massive gravities have a causality puzzle, but it 
, appears soluble by the introduction and judicious use of gauge freedom. 

o 

Oh. 1. Universal Coupling and Massive Gravity: Historical Sketch 

Although the field approach to Einstein's equations is sometimes contrasted with the geometrical 
approach to gravitation (supposedly Einstein's), in fact core ideas of the field approach, such 
as the gravitation-electromagnetism analogy, universal coupling to a combined matter-gravity 
energy-momentum complex as source, and conservation (in the sense of an ordinary divergence) 
of energy- momentum due to the gravitational field equations alone, were employed by Einstein 
in the 1910s [1-4] in his quest for the gravitational field equations. The field approach enjoyed 
a revival in the 1950s- 70s [5], especially in works by Kraichnan [6], Gupta [7], Feynman [8], 
and Deser [9]. In the 1960s V. I. Ogievetsky and I. V. Polubarinov (OP) derived a 2-parameter 
family of Poincare-symmetric massive Einstein's equations. Universal coupling was not used, 
but the spin 1 degrees of freedom were removed from the interacting theory to avoid negative 
energies and one spin to preserve locality [10]. Thus spin 2 and one spin degrees of freedom 
remained. Independently, Freund, Maheshwari and Schonberg (FMS) derived perhaps the most 
attractive member of the OP family using universal coupling with a mass term included [11]. 
Previously the author and W. C. Schieve showed that two one-parameter subfamilies of OP 
theories are universally coupled [12]. Recently (unpublished) the author showed that all OP 
theories are universally coupled. Empirically, massive spin 2-spin gravity matches GR except 
for strong fields or large distances [13, 14]. The spin 0, which is repulsive, theoretically can have 
a mass anywhere between and oo, including both endpoints, so its phenomenology has some 
flexibility [14]. 

Two further questions arise: positive energy and causality. In the late 1930s Pauli and Fierz, 
working to linear order, argued that massive spin 2 theories ought to have no spin 0, because the 
latter was of the wrong sign and so threatened positive energy. While OP seem to have ignored 



this problem for their nonlinear theories, FMS suggested that subtle effects might render the 
wrong-sign spin harmless. In 1970 the van Dam-Veltman-Zakharov discontinuity was derived: 
at linear level, massive spin 2 gravity with no spin disagrees in the massless limit with GTR 
and, more seriously, with experiment [15,16]. Boulware and Deser argued [17] that massive 
gravity was a doomed enterprise because every theory suffered from either violation of positive 
energy (the spin 2-spin case) or empirical refutation (the spin 2 case), or both. This argument 
was widely but not quite universally accepted for over two decades. Since the mid-1990s both the 
empirical inadequacy of spin 2 theories [18] and the negative energy instability of spin 2-spin 
theories [13, 14] have been discussed anew. The former question is of little relevance here (except 
perhaps to the infinitely massive spin case). About the latter question I have little to say here, 
though elsewhere [12] I have suggested that nonperturbative features of the Hamiltonian should 
not be ignored and might be of some help. In any case, the spin 2-spin theories discussed here 
are viable only if the negative energy worry can be handled. While no one has proven that these 
theories are stable, recently Visser and Babak and Grishchuk have suggested that they might 
be [13, 14]. The issue of causality, which was not considered until the last two decades, will be 
discussed at the end. 



2. Ogievetsky-Polubarinov Massive Gravities 

Some mass terms are better motivated than others. The OP mass terms are well motivated. The 
author's recent rederivation of the OP massive gravities using universal coupling (extending that 
in [12] is based on the metrical rather than canonical stress tensor, along the lines of Kraichnan 
[6] and Deser [9] rather than Gupta [7] and FMS [11], so formal general covariance is achieved 
with a flat metric tensor (under arbitrary coordinate transformations) rjuv- (By contrast, OP 
used Cartesian coordinates with imaginary time.) The OP theories are best expressed using 
tensor densities of arbitrary real weight. The flat metric's weight — I covariant concomitant 
is fj^ = y/—r] ~ l r}^; its inverse, the weight / contravariant concomitant, is fj^ v = \J—r\ l r] fiu . 
The densitized curved metric g^ v and its inverse g^ v are analogously defined in terms of g^ u . 
In addition to the density weight /, the OP theories are parametrized by another parameter n 
(/ / \,n / 0). The two metrics are connected by the equation g^ v = rf v + A7 Mly , where 7^ is 
the (weight I) gravitational potential and A = —\/32irG. The parameter n gives the power (in 
the sense of a binomial series) to which g^ u is raised: 

WT = vT + nAf" + n{n ~ l) \ 2 l^ ap r u + ■■■ 

Thus, for example, [g pu ] 2 = g^ a fiap9 pu and [g^]' 1 = fj >ia 9apV pu ■ I n case this series diverges, 
these arbitrary real powers can be defined using a generalized eigenvalue formalism based on 
the Segre classification of g^ v with respect to r)^ v [19]. 

The action for the S for the OP theories, once stripped of parts that contribute nothing to 
the equations of motion, is 

J d 4 xV^R( 9 ) + S matter [g^,u] + J d 4 x(2^[2l - 1] - I^[sH%„). 

The mass term contains a formal cosmological constant term y/^g and the all-important new 
piece involving —\\[—Vj\g ilv \ l 'f] l iv, which ensures Yukawa exponential fall-off (rather than the 
increase with distance that a cosmological constant produces), makes the flat metric observable 
in principle (though empirically the graviton mass is tiny if nonzero), and destroys the gauge 
freedom. The terms in the action which do not contribute to the field equations include a 
constant term that ensures vanishing energy for Minkowski space-time and a divergence term. 
The case I = l,n = 1 gives the FMS theory, which has also been adopted by Logunov [20]. 



3. Constrained Dynamics of OP Massive Gravities 

While some Hamiltonian treatment of massive GR was given by Boulware and Deser [17] and 
the Dirac-Bergmann constrained dynamics formalism was applied to linearized massive GR by 
Marzban, Whiting, and van Dam [21], apparently no explicit constrained dynamics treatment of 
nonlinear massive variants of Einstein's equations has been given. Although Marzban, Whiting 
and van Dam found the linear spin 2 theory to have some nontrivial features in the constrained 
dynamics analysis, the linear spin 2-spin theories behave much as one would expect from 
the Proca massive electromagnetic analog with mass term —^m 2 A^A^. It is useful to see 
what differences an exact (nonlinear) treatment of gravity gives. I follow the standard GTR 
treatment [22]. Some terms that do not affect the field equations — constants and divergences — 
will be dropped from the Lagrangian density and an ADM split will be employed. For isolated 
sources without radiation, one expects exponential Yukawa falloff, so discarding boundary terms 
for massive theories is less dangerous than it is for GR. The notation of lapse N, shift vector 
(3 1 , curved spatial metric hij, and extrinsic curvature is employed, along with the pseudo- 
vectorial abbreviation = (N,/3 l ). I set 16irG = 1. For the flat metric, Cartesian coordinates 
are chosen so rj^ u = diag(—l, 1, 1, 1). The symbol « means equality on the constraint surface. 
The Lagrangian density from which the Hamiltonian density will be found is therefore 

/ — • ■ m 2 1 

C = NVh(K lJ Kij - K\K] + R\h l3 \) + ^_( 2> /=£[21 - 1] - -[sH'V). 

The conjugate momenta are the same as in GR: IT-? = P^( x ) = = are primary 

constraints. The generalized Legendre transformation gives the canonical Hamiltonian density 

... . ... . rn^ [ 

H = IP 3 ^ + P^W -£ = U^hij + -£ = NH + ifH % - ^-^-{2y/^[2l - 1] - -[sH"^), 

where Hq, Hi are the familiar functions of hij, W 3 , and their spatial derivatives. Using 

yf^^g = N\fh, one can absorb — ^Jg™ y/—9 into Ho without affecting the Dirac 'algebra' 
of Poisson brackets [22]. Having done so, I now write H m i for the leftover piece of the mass 
term, 32 wGri 2 [ff M "] n ^W- Thus 7i = NHo + (3 l Tli + T~L m i- As usual demanding the preservation of 
the primary constraints P M = gives secondary constraints |^ = Hq + , ^ = Hi + • 
For n = ±1, it is straightforward to treat H m i explicitly, but not in general. Preserving the 
(smeared) secondary constraints / d^x-^j I ^{x) involves the primary Hamiltonian J d 3 y(NTlo-\- 

f3 l TCi + TC m i + U^Pfj) with Lagrange multipliers U^. Now one uses the Dirac algebra. If qn»!)n v ' 
the coefficient of U^, is an invertible matrix, then no tertiary constraints arise. 

Are constraints all second class? The answer depends on the determinant of the matrix of 
Poisson brackets, 



det 



det 



{P ^dH {y)} ] T o -^- S (x,y) 

_{^{x),P u (y)} J lA.S^y) 



Thus the large difficult determinant reduces in effect to a smaller and simpler one. All 
constraints are second class if the matrix q^^u is invertible, which was the same condition 
that would exclude tertiary constraints. This matrix is algebraic in the field variables and 
is determined solely by TL m i- Due to algebraic complexity, it still is not easy calculate the 
determinant in question for general n. However, there are two special cases where it becomes 



straightforward, n = ±1 (for which the universal coupling derivation has been presented [12]). 
The case n = 1, which corresponds to a contravariant metric density (to the first power) 
and includes FMS, gives det ~ (I - 2)(l - 3) + 1(1 - l)N 2 h^8 i:j + - 2)(l - 1), which 

is nonzero in general (unless perhaps some singular field configurations could give degenerate 
cases). The case n = — 1, which roughly corresponds to a covariant metric density, yields 
det ~ (I — — 2)N 2 + 1(1 + l)hij8 1 ^ + 1(1 — l)(3 l hij(3^ which also is nonzero usually or always. 
For general n in the weak field limit, one gets det ~ (n — 1) (4Z 2 — 4/ + 4) + 4/ 2 - 8/ + 6 / 0, 
which is nonzero for all values of I and n that exclude tachyons. Thus even the infinite spin 
mass case, which OP describe as pure spin 2 [10], does not behave like the Marzban- Whiting- van 
Dam spin 2 case, which has tertiary and quarternary constraints, for a total of 10 second class 
constraints at each point, leaving 5 degrees of freedom (spin 2). The OP theories have 8 second 
class constraints and thus so have 6 degrees of freedom (spin 2 and spin 0), in agreement with 
Boulware and Deser's conclusion that any massive variant of the (nonlinear) Einstein equations 
must have 6 degrees of freedom [17], even if the mass term would yield 5 degrees of freedom with 
the linearized Einstein equations. Thus the algorithm terminates without tertiary constraints, 
much as it does for Proca's massive electromagnetism. The lapse N and shift (5 % are determined 
by hij, 7r u and r)^. Thus the Hamiltonian density H = NHq + (5 %/ Hi + H m i + • • • (constant term 
and total divergence omitted) can be written partly on-shell as Ti f» H m i — N ' — (3 t + . . . , 
which (apart from the discarded divergence that integrates to for Yukawa exponential fall-off) 
can be expressed as a fairly simple algebraic function of the two metrics without the momenta. 
For the FMS theory (n = 1, 1 = 1), H « ]f^(^r - 1) (plus a divergence). If the claimed 
negative energy instability exists, numerical relativists might detect it while working in spherical 
symmetry. 

4. Causality, Two Metrics and Gauge Freedom 

Massive gravity is special relativistic because r]^ u is observable in principle, being present in the 
field equations. The theories's symmetry group is the Poincare group (or the conformal group 
for massless spin 0). Although nongravitational test bodies follow geodesies of g^ u , gravity also 
sees rj^. Thus the propagation of gravitational or matter radiation is subject to the usual special 
relativistic argument that propagation outside the null cone of r]^ v in one inertial frame entails 
backwards causation in another, which presumably one wants to avoid. It follows that either 
massive gravity is acausal, or the light cone of is nowhere outside the null cone of 7]^ u (the 
condition of ??-causality) [19,23,24]. (How seriously one takes this issue depends on whether 
the Minkowski background is taken as part of fundamental physical law or an approximate and 
contingent description of phenomena in our region of space-time. In the former case one might 
hope to quantize using the background's null cone for defining the equal times in equal-time 
commutation relations.) Causality difficulties for higher spin fields are not unprecedented. Long 
ago Velo and Zwanziger, studying the spin | field, found that "[t]he main lesson to be drawn 
from our analysis is that special relativity is not automatically satisfied by writing equations 
that transform covariantly. In addition, the solutions must not propagate faster than light." [25] 
One proposed solution to the causality problem for massive gravity is to stipulate that solutions 
with the wrong relationship between the two null cones are ipso facto unphysical [20]. This 
strategy is reasonable, as long as obviously physical solutions satisfy the causality criterion. 

However, for massive gravity (including massive spin here), the static gravitational field 
from sources decays exponentially in Yukawa form, whereas radiation decays only as K Far from 
sources, gravitational radiation will resemble a linearized gravity plane wave, which violates rj- 
causality [19], and the static field from the source will have decayed away exponentially and 
so be negligible. In response Chugreev has invoked the infinite distribution of matter in some 
standard cosmological models [26]. While this move might perhaps save causality in the actual 
world, it hardly rescues other presumably physical solutions, such as finite-range analogs of the 



Schwarzschild or Kerr-Newman solutions, or solutions with a radiating localized source. Perhaps 
the whole universe is in fact filled with matter, but that is certainly not a necessary truth; it 
might well have turned out that matter was bounded. 

If rejecting bad solutions by hand leaves too few, the alternative [19] would seem to involve 
the introduction of gauge freedom so that the relationship between the two null cones is not 
rigidly fixed by the physical state of the world. Several ways to convert massive GR to a gauge 
theory might be considered. One way is by parametrization, that is, the introduction of clock 
fields [27, 28]. In effect one turns un- variational Cartesian coordinates into variational clock fields 
by replacing r}n U in field equations (but not in the boundary conditions or notion of causality) 
in terms of clock fields X A by r}^ — > X a , i1 t]abX b , u , where t]ab = diag(—l, 1, 1, 1). For the 
n = 1, 1 = 1 FMS theory, C then looks like GR plus 4 minimally coupled scalars (one with the 
wrong sign), but the boundary conditions are different. After a change of variables, the resulting 
gauged massive GR resembles Proca electromagnetism gauged with Stueckelberg's trick. For 
other massive gravities, the outcome will be more complicated. Presumably other conversion 
technologies, such as BFT or Gauge Unfixing, could also be used. 

The gauge freedom could be used to make the correct null cone relationship happen 
in suitably many cases [19]. The naive gauge transformations act on the two metrics as 
9fiu —> G £ ig fiu ,rj fiu — > r]^ v . One can then redefine gauge transformations so that only 77- 
causality respecting states are connected by gauge transformations. Then 77-causality holds 
by construction. Thus those transformations that connect a state violating 7)-causality and one 
satisfying it are not truly gauge transformations. In other words, g^ u must respect 7?-causality, 
so not every vector field ^ generates a gauge transformation for every g^ u and rj^u- Gauge 
transformations can be identified not with a generating vector field only, but with an ordered 
triple involving the metrics: (e £ t , rj^, g^u), where g^ and e £ ^g fJil/ satisfy 77-causality. Two gauge 
transformations (e £ ^ , 7/2, 52), (e 5 , <7i) can be composed to give a new gauge transformation 
(e £ * ,rj2, 52) (e^ ,rji, gi) = (e £ *e £i ,rii,gi) if and only if the flat metrics match (772 =171) and 
consecutive curved metrics match (52 = e *gi-) Not every pair of gauge transformations can be 
composed, so gauge transformations form not a group, but a groupoid (in the sense of Brandt). 
According to Ramsay, "[a] groupoid is, roughly speaking, a set with a not everywhere defined 
binary operation, which would be a group if the operation were defined everywhere." [29] The 
formal definition is expressed by Renault [30] as follows: a groupoid is a set G with a product 
map (x, y) — > xy : G 2 — > G, where G 2 C G x G is the set of composable ordered pairs, and the 
inverse map x — > x -1 : G — > G is such that: 

(i) (x-^-^x, 

(ii) if (x,y) and (y,z) are elements of G 2 , then (xy,z) and (x,yz) are elements of G 2 and 

(xy)z = x(yz), 

(iii) (x _1 ,x) G G 2 , and if (x,y) G G 2 , then x~ 1 (xy) = y, 

(iv) (x,x~ x ) e G 2 , and if (z,x) G G 2 , then (zx)x^ 1 = z. 

Thus there is no single identity element for all elements, but an element times its inverse gives 
one of many little identity elements. There are as many little identity elements as one needs; 
they behave in the expected fashion. 

By construction, gauge transformations as defined here preserve the qualitative relationship 
between null cones (g inside 77), while quantitative part is gauge fluff. (Analogously, Motter 
restricted coordinate transformations in principled way, so "relativistic chaos is coordinate 
invariant": the Lyapunov exponent's sign is invariant, though its value is not [31].) This 
relationship is based described using a generalized eigenvalue formalism based on a Segre 
classification of g„v with respect to r)^ [19]. In cases satisfying ^-causality (and not arbitrarily 
close to violating it), there exist four linearly independent eigenvectors with real positive 
eigenvalues, one of which eigenvectors is timelike and has the smallest eigenvalue. (For 



generalized eigenvalue problems with an indefinite metric, or alternatively for an eigenvalue 
problem with an asymmetric matrix, the usual theorems do not apply.) Thus gauge 
transformations can rotate and boost the eigenvectors in various ways and change the 
eigenvalues, but cannot change the number or type of eigenvectors and eigenvalues. Thus it 
appears that massive gravity need not be acausal, but achieving causality is nontrivial. 
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